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ABSTRACT 


This  paper  surveys  a  number  of  efficient  randomized  algorithms  for  selection 
and  sorting  which  we  derive  from  inefficient  deterministic  specifications.  Along 
with  these  derivations,  we  simultaneously  derive  bounds  on  the  probability  distri¬ 
bution  of  the  sequential  and  parallel  time  cost  of  these  algorithms. 

There  are  several  potential  benefits  of  this  work.  We  develop,  for  the  first 
time,  geneal  techniques  for  deriving  randomized  algorithms  from  deterministic 
specifications.  Previous  derivations  only  considered  deterministic  algorithms. 
Furthermore,  our  derivations  encompass  many  known  sequential  and  parallel  ran¬ 
domized  algorithms  for  selection  and  sorting  which  required  separate  proofs  and 
probabilistic  analysis. 

Also,  we  present  a  new  randomized  comparison  sorting  algorithm  that  takes 
O(loglogn)  time  and  uses  nl+t  processors  to  sort  n  keys. 


1.  INTRODUCTION 


1.1  Meaning  of  Derivation 

A  growing  body  of  computer  science  literature  is  concerned  with  deriving 
efficient  algorithms  for  given  mathematical  specifications  which  would  be 
inefficient  to  execute  as  such.  For  example,  [Scherlis  80]  derives  a  family  of  parsing 
algorithms  and  [Reif  and  Scherlis  84]  derive  a  family  of  efficient  depth  first  search 
algorithms  for  various  connectivity  problems  of  graphs  and  digraphs.  The  process 
of  deriving  algorithms  from  given  specifications  could  be  thought  of  as  restating 
the  specifications  in  steps,  using  general  algorithm  design  techniques  and  problem 
specific  knowledge,  such  that  the  specification  at  any  step  of  derivation  reflects  an 
improvement  in  the  efficiency  of  its  execution  over  the  one  in  the  previous  step. 
The  advantage  of  this  derivation  process  is  manyfold.  These  derivations  serve  as 
constuctive  proofs  for  the  correctness  of  a  family  of  algorithms.  Also,  they  enable 
us  to  understand  existing  efficient  (and  possibly  complicated)  algorithms  for  the 
problem  at  hand  better,  than  merely  reading  the  proofs  of  their  correctness.  In  this 
paper  we  derive  randomized  algorithms  for  selection  and  sorting. 

1.2  Randomized  Algorithms 

A  randomized  algorithm  A  defines  a  mapping  from  an  input  domain  D  to  a 
set  of  probability  density  functions  over  some  output  domain  V .  For  each  input 
x  £D,  A{x)  :  D— ►[0,1]  is  a  probability  density  function,  where  A(z)  (y)  6  [0,1]  is 
the  probability  of  outputting  y  given  input  x.  In  order  for  A(x)  to  represent  a  pro¬ 
bability  mass  function,  we  require 

£]A(x)  (y)  =  1,  for  each  x£  D . 
ye# 

A  mathematical  semantics  for  randomized  algorithms  is  given  in  [Kozen  80]. 
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Two  different  types  of  randomised  algorithms  can  be  found  in  the  literature: 
l)thoee  algorithms  which  always  output  the  correct  answer  but  whose  run  time  is 
a  random  variable  with  a  specified  mean,  and  2)those  which  output  the  correct 
answer  with  some  probability  (the  probability  space  being  the  set  of  all  possible 
inputs).  For  example,  the  randomized  sorting  algorithm  of  [Reischuk  81]  is  of  the 
first  type  and  the  primality  testing  algorithm  of  [Rabin  76]  is  of  the  second  type. 
In  general,  the  use  of  probabilistic  choice  in  algorithms  to  randomize  them  has 
often  lead  to  great  improvements  in  their  efficiency.  The  randomized  algorithms 
we  derive  in  this  paper  will  be  of  the  first  type. 

1.3  Deriving  Randomised  Algorithms 

No  previous  paper  has  given  derivations  of  randomized  algorithms  (for  any 
problem).  We  begin  with  the  mathematical  specifications  of  selection  and  sorting 
problems.  We’ll  encode  these  specifications  into  canonical  algorithms.  Various  algo¬ 
rithms  found  in  the  literature  for  these  problems  will  then  be  derived  as  special 
cases  of  these  canonical  algorithms.  The  former  algorithms  result  from  specializing 
and/or  modifying  one  or  more  of  the  steps  in  the  canonical  algorithms. 

To  start  with,  we  derive  and  analyze  a  random  sampling  algorithm  for 
approximating  the  rank  of  a  key  (in  a  set).  This  random  sampling  technique  will 
serve  as  a  building  block  for  the  selection  and  sorting  algorithms  we  derive.  We 
analyze  the  run  time  for  both  the  sequential  and  parallel  execution  of  the  derived 
algorithms. 

1.4  An  O(Ioglogn)  Time  Sorting  Algorithm 

Many  optimal  parallel  comparison  sorting  algorithms  are  available  in  the 
literature.  These  algorithms  are  optimal  in  the  sense  that  the  product  of  time  and 
processor  bounds  for  these  algorithms  equals  the  lower  bound  of  the  run  time  for 


sequential  comparison  sorting.  These  processors  run  in  time  O(logn).  Some  of 
these  algorithms  are  l)Reischuk’s[Reischuk  81]  randomized  algorithm,  2)AKS 
deterministic  algorithm[AKS83],  3)Column  Sorting  algorithm  [Leighton  83], 
4)FLASH  SORT  algorithm[Reif  and  Valiant  83].  The  first  three  algorithms  run  on 
the  PRAM  model  whereas  the  fourth  algorithm  runs  on  the  fixed  connection  model. 
We  give  a  new  (non  optimal)  sorting  algorithm  that  runs  in  time  O(Ioglogn)  time 
and  which  uses  0(n1+<)  processors,  for  any  e>0. 

1.5  Organisation  of  this  Paper 

In  section  2  we  define  the  selection  and  sorting  problems  and  our  parallel 
comparison  tree  machine  model.  In  section  3  we  give  and  analyze  an  algorithm  for 
computing  the  rank  of  a  key  approximately.  We  also  state,  in  this  section,  an 
important  result  from  sampling  theory  which  will  be  used  throughout  the  rest  of 
the  paper.  Finally,  in  sections  4  and  5  we  derive  and  analyze  various  randomized 
algorithms  for  selection  and  sorting.  In  section  5  we  also  give  a  new  comparison 
sorting  algorithm  that  runs  in  time  O(loglogn)  time. 

2.  COMPARISON  PROBLEMS  AND  PARALLEL  MACHINE  MODELS 

2.1  Comparison  Problems 

Let  X  be  a  set  of  N  distinct  keys.  Let  <  be  a  total  ordering  over  A'.  For 
each  key  x£X  define 

ranker, A')  =  I  (a/GA'  /  J <x)  I  +1. 

For  each  index  i,  we  define  sc\prt.(i  ,X)  to  be  that  key  x£X  such  that 

i  =  rank(z, AT).  Also  define 


saUpO  =  (x^xj,  .  .  .  ,xN) 

where  x,  «  select (i  JT),  for  i=l,...,N.  As  defined,  these  functions  are  expensive  to 
compute.  For  example,  the  sort  definition  requires  N 2  comparisons. 

2.2  Parallel  Comparison  Models 

In  the  sequential  comparison  tree  model  of  [Knuth  73]  a  single  step  at  a  node 
of  the  tree  consists  of  a  comparison  of  two  keys.  The  outcome  of  this  comparison 
takes  the  execution  to  a  child  of  this  node.  The  leaves  of  the  tree  provide  output 
values.  The  run  time  in  this  model  is  the  number  of  nodes  visited  on  a  given  execu¬ 
tion.  A  distinct  tree  is  allowed  for  each  input.  In  a  randomized  comparison  tree 
model  execution  from  any  node  branches  to  a  random  child  depending  on  the  out¬ 
come  of  a  coin  tossing. 

[Valiant  75]  describes  a  parallel  comparison  tree  machine  model  which  is 
similar  to  the  sequential  tree  models,  except  that  multiple  comparisons  between 
keys  are  allowed  on  each  step.  Thus  a  comparison  tree  machine  with  p  processors 
is  allowed  a  maximum  of  p  comparisons  at  each  node,  which  are  executed  simal- 
taneously.  We  allow  our  parallel  comparison  tree  machines  to  be  randomized, 
with  random  choice  nodes  as  described  above. 

2.3  Parallel  RAM  Models 

More  refined  machine  models  of  computation  also  take  into  account  storage 
and  arithmetic  steps.  The  sequential  random  access  machine  (RAM)  described  in 
[Aho,  Hopcroft,  and  Ullman  76}  allows  a  finite  number  of  register  cells  and  also 
infinite  global  storage.  A  single  step  of  the  machine  consists  of  an  arithmetic 
operation,  a  comparison  of  two  keys,  reading  off  the  contents  of  a  global  cell  into  a 
register,  or  writing  the  contents  of  a  register  into  a  global  memory  cell. 
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The  parallel  version  of  RAM  proposed  by  [Fortune  and  Wylie  78]  allows  mul¬ 
tiple  RAMs  to  be  generated  from  a  single  original  RAM  by  execution  of  a  fork 
operation.  This  model,  known  as  PRAM,  allows  multiple  concurrent  reads  but 
prohibits  concurrent  writes.  WRAM  model,  which  is  a  variation  of  PRAM,  permits 
concurrent  reads  and  concurrent  writes.  There  are  three  variations  of  WRAM 
depending  on  how  the  write  conflicts  are  resolved. 

The  models  we  employ,  in  this  paper,  for  various  algorithms  will  be  the  ones 
used  by  the  corresponding  authors. 


3.  RANDOM  SAMPLING 


"  "  •  '  •  '  -  "  •  ■■■  V  .-J  gr-TC  V  WW'.'  IT  V j  .■  '  y  tf  ."■«  ".■  ■;«  v -  «  v -  .  ., -  V 


problem:  "Given  a  set  X  and  a  random  igX.  Also,  S  is  a  random  subset  (or  sam¬ 
pling)  of  X.  If  the  number  of  elements  <x  in  S  is  l,  how  many  elements  in  X  are 
<*?"  Let  r,=  rank(select(«,S),  X).  In  the  next  subsection  we’ll  obtain  the  distribu- 

N 

tion  of  r{  and  show  that  Mean(r,)  =  —  . 

3.2  Distribution  of  r,- 

Let  y„y2,  .  .  .  ,  y,be  a  random  sample  from  a  continuous  distribution  with 
density  function  f(y).  If  ^)<^)<  <y(#,  is  the  sorted  order  of  the  sam¬ 

ple,  then  y(i)  is  called  the  »th  order  statistics  of  the  sample  Yif  .  .  .  ,  Y,.  If 
/(y)  =  1  in  [0,1]  and  0  elsewhere,  then  the  density  function  of  the  ith  order  statis¬ 
tics  [Wilks  76]  is  given  by 

/<(»«))-  (i-iyl-iji  ylo1(1-V(.)),~’  - 

which  is  a  Beta  distribution  with  parameters  («,s  — t  +  l).  Therefore,  asymptotically, 

—r  has  a  Beta  distribution  with  parameters  (i',«-«+l),  immediately  implying  that 
N 

Mean(r,)=>  ^  and  Var(r,)=— ^ — l  +  1I  ■  jV2  .  From  this  Beta  distribution  we  can 

V  *+l  V  (s+l)2(s+2) 

also  obtain  the  following  confidence  interval  on  r,-. 


Lemma  3.2  For  every  a,  Prob.  Ir, 


<  N  a  for 


some  constant  c . 

Proof  We’ll  assume  log/V=o(s)  in  the  following  proof.  Also,  this  proof  is  given  in 
two  cases  on  the  parameters  of  the  Beta  distribution. 
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Caael  Both  »  and  a— »+l  are  >  Va/IogN. 

Let  Z  be  a  random  variable  with  probability  density  function  /(z),  mean  fi  ,  and 
variance  o*.  It  is  a  known  fact  from  statistics  that  if  the  skewness  of  Z  is  0,  then 

can  be  approximated  by  the  normal  variate  N(0,1).  1  i  skewness  of  a  Beta 

distribution  with  parameters  (a  ,  0)  is  0  when  either  a  =  0  or  both  a  and  0  are  oo. 
Therefore,  when  both  »  and  a— i+l  are  >Ve/log N ,  asymptotically,  the  skewness 

of  tf  tends  to  0,  making  the  approximation  of  — — —  by  the  standard  normal  vari- 

<T 

ate  meaningful.  And  hence, 


JV2  N 

Notice  that  Var(r,)  <  — Now,  substituting  Mean(r,)  *=  » -  and 

2a  a+1 


Var(r-,-)  <  — —  ,  we  get  the  desired  result. 


Case2  Either  »  or  a—  i+l  is  <  Va/logN. 

The  case  i  <  Va/log N  is  identical  to  the  case  (a— i+l)  <  Va/logN.  So,  we’ll 
/*  consider  only  the  case  i  <  Va/logN. 

If  kltk2,...,k,  are  the  elements  of  the  random  sampling  set  S  in  sorted  order,  then 
these  elements  divide  the  set  X  into  (a+1)  subsets  Ni,X2,...,X,+j  where 
Xi  —  {x^AT  /  x^tj,  Xf  =  {x€AT  /  *,_t<x<*,},  for  i=2,...,a  and 
Af,+i  =  {z&X  /  x>k,}.  In  terms  of  the  cardinalities  of  these  subsets,  r,-  can  be 

expressed  as  \X^  for  «*l,...,a.  It  can  be  shown  that(see  lemma  3.3  below)  the 
i-i 
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maximum  cardinality  of  any  subset  X{  is  <  — logN,  with  probabilty  greater  than 


■  N, 


l—0(N~a)  (for  any  a).  This  result  immediately  implies  that  r,-  <  i — logN  with 


probability  >  1  —  0(N~a)  thus  proving  our  claim. 


Lemma  3.3  A  random  5 CAT  of  size  a  divides  X  into  a+1  subsets  as  explained 


a  N, 


above.  The  maximum  cardinality  of  any  of  the  resulting  subsets  is  < - log,  A 

2  a 


with  probability  greater  than  1— N~°.  (lAf1=JV). 


Proof 

First  we’ll  compute  the  probability  that  at  least  one  subset  is  of  size  at  least  v. 
Realize  that  the  selection  of  a  random  subset  S  of  X  results  in  a  random  (a+l)- 


partition  of  X.  Total  number  of  (a+l)  partitions  of  X  is 


N- 1 
a 


.  Number  of  parti- 


N-v-l 

a 


tions  of  X  that  have  at  least  one  part  of  size  >v  is 
Therefore,  the  probability  that  there  is  at  least  one  part  which  is  of  size  at  least  v 


is 


p  = 


N-v- 1 
a 


N—l 

a 


Using  Stirling’s  approximation, 


p  (N-v- 1)*— 1  (N-a-  1)N—' 
[N-a-v- i)W~— — >  (/V- l)N-‘ 


1- 


v+1 


N 


N-v- 1 


1- 


a  +  1 


\N—v—\ 


N 


1- 


a+v+1 


\N-i-t-l 


N 


N-] 


U 11  <C  rff  1 1 . .  j  1 1  ■  n  |J  «HJ ,  J  U  j ,  J I  ,J  I J  vm  mnvr  w? 


Using  the  fact 


I 


P  <  exp 


— 2v» 
N 


jf  v^2.2L\0i  N  then  p  <  N~°. 

2«  — 

i.e.,  the  probability  that  none  of  the  resulting  subsets  is  of  size  greater  than 
~~ logeiV  is  >  which  is  our  claim. 

it  8 


This  completes  our  proof  of  Lemmas  3.3  and  3.2.  These  Lemmas  will  be  used 
repeatedly  here  after. 


4.  DERIVATION  OF  RANDOMIZED  SELECT  ALGORITHMS 

4.1  A  Summary  of  Select  Algorithms 

Let  X  be  a  set  of  N  keys.  We  wish  to  derive  efficient  algorithms  for 
select(i,X)  where  Recall  we  wish  to  get  the  correct  answer  always  but 

the  run  time  may  be  a  random  variable.  We  display  a  canonical  algorithm  for  this 
problem  and  then  show  how  select  algorithms  in  the  literature  follow  as  special 
cases  of  this  canonical  algorithm. 


algorithm  canselect; 
hegin 

select  a  bracket  (i.e.,  a  subset)  B  in  X  such  that 
select(«,X)  lies  in  this  bracket  with  very  high 
probability; 

Let  «i  be  the  number  of  keys  in  X  less  than  the 
smallest  element  in  B; 
return  canselect(i— »i  ,  B) 

end; 


Select  algorithm  of  [Hoare  61]  chooses  a  random  eplitter  key  k  £  X,  and 

recursively  considers  either  the  low  key  set  or  the  high  key  set  based  on  where  the 

ith  element  is  located.  And  hence,  B  for  this  algorithm  is  either  (z  6  X  /  x  <  k) 

or  {x  £  X  /  x  >  k)  depending  on  which  set  contains  the  ith  largest  element  of  X. 

i  i  N 

1  B  I  for  this  algorithm  is  —  for  some  constant  c . 

On  the  other  hand,  select  algorithm  of  [Floyd  and  Rivest  75]  chooses  two 
random  splitters  /tj  and  Jfc2  and  sets  B  to  be  {x  6  X  /  k\  <  x  <  k2}.  k j  and  k2  are 
chosen  properly  so  as  to  make  I  B  I  =  0(Na),  a  <  1.  We’ll  analyze  these  two 
algorithms  in  more  detail  now. 


4.2  Hoare’a  Algorithm 

Detailed  version  of  Hoare’s  select  algorithm  is  given  below. 


algorithm  Hselect(«  ,X); 

(assert  1  <  «  <  I  X  I ) 

hegin 

if  X  =  {z}  thpn  return  z; 

Choose  a  random  splitter  k  (-  X: 

Let  B  =  {z  €  X  /  x  <  *}; 

if  |  B  I  >  i  then  return  Hselect(t',B) 

else  return  Hselect(t  —  IbI  ,  X  —  B) 

end; 


Let  Tp  (i,N)  be  the  expected  parallel  time  of  Hselect(«,X)  using  at  most  p 
simaltaneous  comparisons  at  any  time.  Then  the  recursive  definition  of  Hselect 
yields  the  following  recurrence  relation  on  Tp  ( i,N ). 


Tp  (i,N)  =  -  +  -Jr  E  ^  N-j)  +  £  Tp  (»,/)  . 


An  induction  argument  shows 


and 


Tj  (*>N)  <  2iV  +  min(i,7V— »)  +  o(N). 

To  improve  this  Hselect  algorithm,  we  can  choose  k  such  that  B  and  X  —  B 
are  of  approximately  the  same  cardinality.  This  choice  of  k  can  be  made  by  fusing 
samplerank(  into  Hselect  as  follows. 


algorithm  sampleselect,  (t  X); 

(assert  1  <  *  ,  i  <  Ixj) 

hegin 

if  X  =  {z}  then  return  x; 

Choose  a  random  sample  set  5  C  X  of  size  «; 
Let  k  **  select([«/2j,  S); 

Let  B  =  {z  £  X  /  x  <  jfc>; 

iflsl  >  i  then  return  sampleselect,  (i,B) 

else  return  sampleselect,(»  —  |f?l  ,  X  —  B) 

end; 


This  algorithm  can  easily  be  analyzed  using  Lemma  3.2. 


4.3  Algorithm  of  Floyd  and  Rivest 

As  was  stated  earlier,  this  algorithm  chooses  two  keys  Jt,  and  k2  from  X  at 
random  to  make  the  size  of  its  bracket  B=0(N^)  ,  P<1.  The  actual  algorithm  is 


algorithm  FRselect(« ,X); 
hegin 

if  X  =  {1}  then  return  z; 

Choose  ,k2  £  X  such  that  kt  <  k2  ; 

Let  =  rank(/tl  ,  X)  and  r2  =  rank (k2  ,  X); 
if  rj  >  i  then  FRselect(«',{z  £  X  /  z  <  J tj) 
else  if  r2  >  »  then  FRselect(»  — rj  ,  {z  £  X  /  Jtj  <  z  <  k2}) 
else  FRselect(i-r2  ,  {x  £  X  /  z  >  Ar2}) 

end; 


Let  Tp  (t  ,N)  be  the  expected  run  time  of  the  algorithm  FRselect(»',A’’)1  allow¬ 
ing  at  most  p  simaltaneous  comparisons  at  any  time.  Notice  that  we  must  choose 
*1  and  k2  such  that  the  case  rt  <  1  <  r2  occurs  with  high  likelyhood  and  r2  —  rx  is 
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not  too  large.  This  is  accomplished  in  FRselect  as  follows. 


Choose  a  random  sample  S  C.  X  of  size  a.  Set  i fc,  to  be 

select  ~  $  and  set  k2  to  be  select  +  6,  S .  If  the  parameter  6 

is  fixed  to  be  fVdos  logiVl,  for  some  constant  d,  then  by  lemma  3.2, 
Porb.[r,  >  «]  <  N~°  and  Prob.[r2  <  «']  <  N~°.  Let  Tp  (— ,«)  =  max  Tp  (/,«).  The 

resulting  recurrence  for  the  expected  parallel  run  time  with  p  processors  is 


T,  ( i,N )  <  y  +  Tp  (-.«) 

+  Prob.[ri  >  »']  X  Tp  (i.r,) 

+  Prob.[«  >  r2]  X  Tp  (t-r,  ,  N~r2) 

+  Prob.(rj  <  «  <  r2]  X  Tp  (i-r,  ,  r2-r,) 

<  y  +  Tp  (-,«)  +  2  N~°  X  N  +  Tp 


Note  that  1,  and  k2  are  chosen  recursively.  If  we  fix  da  <  3  and  let 
a  =  N2/3  logTV,  the  above  recurrence  yields  (Floyd  and  Rivest  75] 

T,  (i,N)  <  N  +  min  (i,N—i)  +  0(s). 

Observe  that  if  we  have  A^2  processors,  we  can  solve  the  select  problem  in  one  time 
unit,  since,  all  possible  pairs  of  keys  can  be  compared  in  one  step.  This  implies 
that  Tp  (i,N)  =  1  for  p  >  N2.  Also,  from  the  above  recurrence  relation, 

7*  (i,N)  <  0(1)  +  Tn  (-,VjV)  =  0(1), 

as  is  shown  in  [Reischuk  81]. 
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5.  DERIVATION  OF  RANDOMIZED  SORTING  ALGORITHMS 


5.1  A  Canonical  Sorting  Algorithm 

The  problem  is  to  sort  a  given  set  (X)  of  N  distinct  keys.  The  idea  behind  the 
canonical  algorithm  is  to  divide  and  conquer  by  splitting  the  given  set  into  (say)  «] 
disjoint  subsets  of  almost  equal  cardinality,  to  sort  each  subset  recursively,  and 
finally  merge  the  resultant  lists.  A  detailed  statement  of  the  algorithm  follows. 


algorithm  cansort(X); 
hpgin 

ifX={x}ih£n  return  X; 

Choose  a  random  subset  5  C  X  of  size  «  ; 

Let  Sj  be  sorted  S; 

As  explained  in  section  3.2,  St  divides  X 

into  «+l  subsets  X\  ,  X2  ,  •  •  •  t  %•+ 1  > 

return  cansort(Xi)  .  cansort(X2) . cansort(X,+I); 

end; 


Now  we’ll  derive  various  sorting  algorithms  from  the  above. 

5.2  Hoare’s  Sorting  Algorithm 

When  a  =  1  we  get  Hoare’s  algorithm.  Hoare’s  sorting  algorithm  is  very  much 
similar  to  his  select  algorithm.  Choose  a  random  splitter  k  6  X  and  recursively 
sort  the  set  of  keys  {x£X  /  x<ifc}  and  {x€X  /  x>*}. 

algorithm  quicksort(X); 

hegin 

if  Ixl  =  1  then  return  X  ; 

Choose  a  random  k£X) 

return  quicksort({x£X  /  x<k})  .  (k)  .  quicksort({x£X  /  x>£}); 

end; 


Let  Tx  (N)  be  the  number  of  sequential  steps  required  by  quicksort(X)  if  1x1  =  N. 
Then, 

T I  (N)  <N- 1  +  ^7  E  (r,  (t-1)  +  r,  (7V-0)  <  2NlogN. 

/V  i-1 
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A  better  choice  for  k  will  be  sampleselect,  ([N/2J,  N).  With  this  modification, 
quicksort  becomes 

algorithm  samplesort,  (X); 
hegin 

if  IXl  =  1  then  return X; 

Choose  a  random  subset  SCX  of  size  s ; 

Let  k  =  select([«/2j,  S); 

return  samplesort,  ({z£X  /  X<k}  .  (/t)  .  samplesort,  ({arCX-  /  sr>/r}); 

end; 

By  Lemma  3.2, 

Prob.  [lrank(Jt,X)  -  N/2  I  >  Vd^NlogN  ]  <  N~° 

for  some  constant  d.  Let  5  (s,N)  be  the  expected  number  of  comparisons  required 
by  samplesort,  (X).  Since  the  [Floyd  and  Rivest  75]  selection  algorithm  requires 
only  N  +  o(N)  comparisons,  we  have  for  s(N)  =  N/\ogN, 

V(s(N),  N)  <  V(s(Ny),  NJ  +  N-*  N)N  +  o(N ) 

where 

Ny  =  N/2  +  (VdaN)logX. 

Solving  this  recurrence  [Frazer  and  McKeller  70]  show 
Z(s(N),  N)  «  NlogjN  , 

which  asymptotically  approaches  the  optimal  number  of  comparisons  required  to 
sort  N  numbers  on  the  comparison  tree  model. 

Let  Tp  ( s,N )  be  the  number  of  steps  on  a  parallel  comparison  tree  model 
with  p  processors  to  execute  samplesort,  (X)  where  Ixl  =  N.  Since  only  a  con¬ 
stant  number  of  steps  are  required  to  select  the  median  k  =  select(N/2,  A')  using 
N  processors,  [Reischuk  81]  observes  for  this  specialized  algorithm  with  s(N)  =  N, 

Tn  ( N,N )  <  0(1)  +  Tn/2  (N/2,  N/2 ) 
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<  O(logV). 


5.3  Multiple  Sorting 

Any  algorithm  with  »>  1  falls  under  this  category.  Let  us  call  cansort  as  mul¬ 
tisort  when  <>1.  As  was  shown  in  Lemma  3.3,  the  maximum  cardinality  of  any 
N 

subset  X{  is  <  — logJV  (=  Nt  ,tay)  with  probability  >  1  —  0(N~°).  Therefore,  if 

Tp  (N)  is  the  expected  parallel  comparison  time  for  executing  multisort,  (X)  with 
p  processors  (where  Ixl  =  N),  then, 

W  <  T,njn  m  +  N-°  T,  (N) 

+  T,  (s)  +  —  +  log(s) 

P 

<  ^VNJN  (Ni)  +  0(1)  +  ~  log(e). 

[Reischuk  82]  uses  the  specialization  a  =  N'/2  which  yields  the  following  recurrence 
for  Tp  (N). 

T»  <  TNi  (V.)  +  |log/V  +  0(1) 

=  0(log/V). 

Alternatively  we  can  set  p  =  Ni+t  and  «  =  Nl  for  any  0  <  £  <1  and  get  an 
iVj  =  Arl_</2VdalogAr  for  some  constant  d.  This  choice  of  5  yields  the  recurrence 

(N)  <  Vw,  M)  +  0(1)  +  N-^ogN 

<  Tn'n,  W  +  0(1) 

<  O(loglogV) 

6.4  FLASHSORT 

[Reif  and  Valiant  83]  give  a  method  FLASHSORT  for  dividing  X  into  even 
more  equal  sized  subsets.  This  method  is  useful  for  sorts  within  fixed  connection 
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networks,  where  the  processors  can  not  be  dynamically  allocated  to  work  on  vari¬ 
ous  size  subsequences.  The  idea  of  [Reif  and  Valiant  83]  is  to  choose  a  subsequence 
5  C  X  of  size  TV1/2,  and  then  choose  as  splitters  every  (ologiV)th  element  of  S  in 
sorted  order,  i.e.,  to  choose  it,-  =  select(oi  [logNj,  S)  for  i  —  l,2,...,TV'/2/logTV. 
Then  they  recursively  sort  each  subset  Xj  =  {zGX  /  <  x  <  Their  algo¬ 

rithm  runs  in  time  O(logTV)  and  they  have  shown  that  after  O(logTV)  recursive 
stages  of  their  algorithm,  the  subsets  will  be  of  size  not  more  than  a  factor  of  0(1) 
of  each  other. 
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